Classifying Evolving Data Streams for Intrusion Detection

نویسندگان

  • Mohammad M. Masud
  • Jing Gao
  • Latifur Khan
  • Jiawei Han
چکیده

Stream data classification is a challenging problem because of two important properties: its infinite length and evolving nature. Traditional learning algorithms that require several passes on the training data are not directly applicable to stream classification problem because of the infinite length of the data stream. Data streams may evolve in several ways: the prior probability distribution p(c) of a class c may change, or the prior probability of observing an example p(x) may change, or both probabilities may change. In either case, the challenge is to build a classification model that is consistent with the current concept. As a result, special techniques are required to classify evolving data streams. Network traffic can be considered as a data stream having both abovementioned properties. Thus, network intrusion detection can be considered as a stream classification problem, where each data point can be an intrusion or benign. A data point may represent a connection, or a sequence of N network packets etc.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Some Clustering Algorithms to Enhance the Performance of the Network Intrusion Detection System

Most current intrusion detection systems are signature based ones or machine learning based methods. Despite the number of machine learning algorithms applied to KDD 99 cup, none of them have introduced a pre-model to reduce the huge information quantity present in the different KDD 99 datasets. Clustering is an important task in mining evolving data streams. Besides the limited memory and one-...

متن کامل

Categorizing Concepts for Detecting Drifts in Stream

Mining evolving data streams for concept drifts has gained importance in applications like customer behavior analysis, network intrusion detection, credit card fraud detection. Several approaches have been proposed for detection of concept drifts in the context of supervised learning in data streams. Recently, researchers have been looking into the problem of identifying concept drifts in unlab...

متن کامل

Classifying Evolving Data Streams Using Dynamic Streaming Random Forests

We consider the problem of data-stream classification, introducing a stream-classification algorithm, Dynamic Streaming Random Forests, that is able to handle evolving data streams using an entropy-based drift-detection technique. The algorithm automatically adjusts its parameters based on the data seen so far. Experimental results show that the algorithm handles multi-class problems for which ...

متن کامل

Mining Evolving Streams with Resource Adaptive Computation

The problem of streaming data has gained importance in recent years because of advances in hardware technology. The ubiquitous presence of data streams in a number of practical domains has generated a lot of research in this area. Example applications include surveillance for terrorist attack, network monitoring for intrusion detection, and others. Problems such as data mining which have been w...

متن کامل

A Novel High Dimensional and High Speed Data Streams Algorithm: HSDStream

This paper presents a novel high speed clustering scheme for high-dimensional data stream. Data stream clustering has gained importance in different applications, for example, network monitoring, intrusion detection, and real-time sensing. High dimensional stream data is inherently more complex when used for clustering because the evolving nature of the stream data and high dimensionality make ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009